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We provide a uniform law for the weak convergence of additive 
functionals of partial sum processes to the local times of linear frac¬ 
tional stable motions, in a setting sufficiently general for statistical 
applications. Our results are fundamental to the analysis of the global 
properties of nonparametric estimators of nonlinear statistical models 
that involve such processes as covariates. 

1. Introduction. Let xt = v s be the partial sum of a scalar linear 

process {vt}, for which the finite-dimensional distributions of d ~ 1 xi nr i con¬ 
verge to those of X(r). Under certain regularity conditions, we then have 
the finite-dimensional convergence 



( 1 . 1 ) 


where a S R, / is Lebesgue integrable, h n = o{d n ) is a deterministic se¬ 
quence, and C denotes the occupation density (or local time ; see Remark 2.5 
below) associated to X. Convergence results of this kind are particularly 
well documented in the case where {xt} is a random walk [see the mono¬ 
graph by Borodin and Ibragimov (1995)], and have more recently been ex¬ 
tended to cover generating mechanisms that allow the increments of {xt} 
to exhibit significant temporal dependence [Jeganathan (2004), Wang and 
Phillips (2009a)]. 

These more general theorems concerning (1.1) have, in turn, played a 
fundamental role in the study of nonparametric estimation and testing in 
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the setting of nonlinear cointegrating models. The simplest of these models 
takes the form 

(1-2) yt = mo(x t ) + ut, 

where {xt} is as above, {ut} is a weakly dependent error process, and mo is 
an unknown function, assumed to possess a certain degree of smoothness (or 
be otherwise approximate). In a series of recent papers, (1.1) has facilitated 
the development of a pointwise asymptotic distribution theory for kernel 
regression estimators of mo under very general conditions: see especially 
Wang and Phillips (2009a, 2009b, 2011, 2015), Kasparis and Phillips (2012) 
and Kasparis, Andreou and Phillips (2012). 1 

However, there are definite limits to the range of problems that can be 
successfully addressed with the aid of (1.1). In particular, since it concerns 
only the finite-dimensional convergence of L n (a,h n ), (1.1) is suited only 
to studying the local behavior of a nonparametric estimator: that is, its 
behavior in the vicinity of a fixed spatial point. For the purpose of obtain¬ 
ing uniform rates of convergence for kernel regression estimators on “wide” 
domains—that is, on domains having a width of the same order as the range 
of {xi}” =1 - it is manifestly inadequate. [See Duffy (2015), for a detailed 
account.] The situation is even worse with regard to sieve nonparametric 
estimation in this setting—which initially motivated the author’s research 
on this problem—since in this case the development of even a pointwise 
asymptotic distribution theory requires a prior result on the uniform con¬ 
sistency of the estimator, over the entire domain on which estimation is to 
be performed. 

The main purpose of this paper is thus to provide conditions under which 
the finite-dimensional convergence in (1.1) can be strengthened to the weak 
convergence 

(1.3) C f n (a,h n ) C{a) [ f, 

JR 

where C n (a , h n ) is regarded as a process indexed by (/, a)£#xl, and {h n } 
may be random. Results of this kind are available in the existing literature, 
but only in the random walk case, which requires that the increments of {xt} 
be independent, and X to be an a-stable Levy motion [see Borodin (1981, 
1982); Perkins (1982); and Borodin and Ibragimov (1995), Chapter V], In 


1 If {xt} is Markov, then this distribution theory may be developed by quite different 
arguments, without the use of (1.1), see Karlsen and Tjqstheim (2001) and Karlsen, Myk- 
lebust and Tjpstheim (2007), which have spawned a large literature. While we consider 
this approach to the problem to be equally important, our results touch upon it only a 
little, since we work with a class of regressor processes that are typically (excepting the 
random walk case) non-Markov. 
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contrast, we allow the increments of {x*} to be serially correlated, such 
that the associated limiting process X may be a linear fractional stable 
motion, which subsumes the a-stable Levy motion and fractional Brownian 
motion as special cases. Further, we permit the bandwidth sequence {h n } 
to be a random process, subject only to certain weak asymptotic growth 
conditions: this is of considerable utility in statistical applications, where 
the assumption that {h n } is a “given” deterministic sequence seems quite 
unrealistic. Crucial to the proof of (1.3) is a novel order estimate for jC J n (a, 1) 
when / / = 0, which is of interest in its own right. 

The remainder of this paper is organized as follows. Our assumptions on 
the data generating mechanism are described in Section 2. The main result 
(Theorem 3.1) is discussed in Section 3. An outline of the proof follows in 
Section 4, together with the statement of two key auxiliary results (Propo¬ 
sitions 4.1 and 4.2). A preliminary application of our results to the kernel 
nonparametric estimation of mo in (1.2) is given in Section 5. The proof of 
Theorem 3.1 appears in Section 6, followed in Section 7 by proofs of Propo¬ 
sitions 4.1 and 4.2. A proof related to the application appears in Section 8. 
The final two Sections 9 and 10 are of a more technical nature, detailing the 
proofs of two lemmas required in Section 7, and so may be skipped on a first 
reading. 

1.1. Notation. For a complete listing of the notation used in this paper, 
see Section H of the Supplement. 2 The stochastic order notations o p (-) and 
Op(-) have the usual definitions, as given, for example, in van der Vaart 
(1998), Section 2.2. For deterministic sequences {a n } and {b n }, we write 
a n ~ b n if lim^oo a n /b n = 1, and a n x b n if lirn 

n—>• oo /b n € (- 00 , 00 ) \ 

{0}; for random sequences, a n < p b n denotes a n = O p {b n ). X n X de¬ 
notes weak convergence in the sense of van der Vaart and Wellner (1996), 
and X n ~q.d.d. X the convergence of finite-dimensional distributions. For a 
metric space ( Q,d ), loo(Q) [resp., l ucc (Q)] denotes the space of uniformly 
bounded functions on Q , equipped with the topology of uniform conver¬ 
gence (resp., uniform convergence on compacta). For p > 1, X a random 
variable, and /:R^M, \\X\\ P := {ElX^) 1 ^ and ||/|| p := (f R \f\ p ) 1/p . BI de¬ 
notes the space of bounded and Lebesgue integrable functions on R. |_ - J and 
[•], respectively, denote the floor and ceiling functions. C denotes a generic 
constant that may take different values even at different places in the same 
proof; a < b denotes a < Cb. 

2. Model and assumptions. Our assumptions on the generating mecha¬ 
nism are similar to those of Jeganathan (2004)— who proves a finite-dimen¬ 
sional counterpart to our main theorem—and are comparable to those made 


2 The Supplement is available as an addendum to arXiv:1501.05467. 
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on the regressor process in previous work on the estimation of nonlinear coin¬ 
tegrating regressions [see, e.g., Park and Phillips (2001), Wang and Phillips 
(2009b, 2012, 2015); and Kasparis and Phillips (2012)]. 


Assumption 1. (i) {ej} is a scalar i.i.d. sequence. Eq lies in the domain 

of attraction of a strictly stable distribution with index a £ (0,2], and has 
characteristic function ^(A) := Ee lAeo satisfying i/j € L po for some po > 1- 
(ii) {xt} is generated according to 


( 2 . 1 ) 

and either: 


t 

x t ■= y ^v s , 

S =1 


vt • yy^k^-t—ki 

k =0 


(a) a € (1,2], YlkLo W < 00 and 0 := or $k ~ k H 1 1/a Tr k 

for some {vr/ c }/ c >o strictly positive and slowly varying at infinity, with 

(b) H > 1/a; or 

(c) H < 1/a and 52kLo<t>k = 0. 


In both cases (b) and (c), H £ (0,1). 


Remark 2.1. Part (i) implies that there exists a slowly varying sequence 
{Qk} such that 


( 2 . 2 ) 


n 


1/a. 


Qn 


[nr\ 

E et 

f.d.d. 

t =1 


Z a (r), 


where Z Q denotes an a-stable Levy motion on R, with Z a ( 0) = 0. That is, 
the increments of Z a are stationary, and for any r\ < r 2 the characteristic 
function of Z a (r 2 ) — Z a (r\) has the logarithm 


-{r 2 -ri)c\X\ a 


1 + i/3 sgn(A) tan 



where (3 £ [—1,1] and c > 0; following Jeganathan (2004), page 1773, we im¬ 
pose the further restriction that /3 = 0 when a = 1. We shall also require 
that {p/c} be chosen such that c = 1 here, which provides a convenient nor¬ 
malization for the scale of Z a . 


Remark 2.2. To permit the alternative forms of (ii) to be more con¬ 
cisely referenced, we shall refer to (a) as corresponding to the case where 
H = 1/a; this designation may be justified by the manner in which the 
finite-dimensional limit of d~ l xy nr \^ depends on (H,a), as displayed in (2.6) 
below. The statement that H <1/a will also be used as a shorthand for (c), 
that is, it will always be understood that 0 = 0 in this case. 
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We shall treat the parameters (including H and a ) describing the data 
generating mechanism as “fixed” throughout, ignoring the dependence of 
any constants on these. Let {q,} denote a sequence with cq = 1 and 


(2.3) 


f cj), if H = 1/a, 

\ \H — 1/ a\- 1 k H ~ x l a 'K} u otherwise. 


By Karamata’s theorem [Bingham, Goldie and Teugels (1987), Theorem 1.5.11], 
y / q (j> k ~ c k as k — y oo. Set 

(2.4) d k :=k 1/a c k g k , e k \=kdf l , 


and note that the sequences {c k }, {d k } and {e^} are regularly varying with 
indices H — 1/a, H and 1 — H, respectively. Theorems 5.1-5.3 in Kasahara 
and Maejima (1988) yield 


Proposition 2.1. Under Assumption 1 , 

(2.5) X n {r):=^-xy nr \~>X(r), re [0,1], 

CLfi i. cl. cl. 

where X is the linear fractional stable motion (LFSM) 

X(r):= f r (r-8) H -^ a dZ a (s) 

(2-6) ° o 

+ [ [(r-^- 1 /a -(- S )^- 1 /o ]dZ Q (s) 

J — OO 

with the convention that X = Z Q when H = 1/a; Z a is an a-stable Levy 
motion on M, with Z a { 0) = 0. 


Remark 2.3. For a detailed discussion of the LFSM, see Samorodnitsky 
and Taqqu (1994). When a = 2, Z a is a Brownian motion with variance 2; 
if additionally H 1/a, X is thus a fractional Brownian motion. 


Remark 2.4. Excepting such cases as the following: 

(i) a€ (1,2], H > 1/a [Astrauskas (1983), Theorem 2]; 

(ii) a = 2, H = 1/a and Eef] < oo [Hannan (1979)]; and 

(iii) a = 2, H < 1/a and E^ol 9 < oo for some q > 2 [Davidson and de Jong 
(2000), Theorem 3.1]; 

it may not be possible to strengthen the convergence in (2.5) to weak con¬ 
vergence on -^x/0,1]. Weak convergence may hold, however, with respect to 
a weaker topology, and we shall be principally concerned with whether this 
topology is sufficiently strong that 

(2.7) inf X n (r) inf X(r), sup X n (r) sup X(r), 
reM re[0,l] r€[0,l] re[0,l] 
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such as would follow from weak convergence in the Skorokhod Mi topology 
[see Skorohod (1956), Section 2.2.10]. When H = 1 /a, sufficient conditions 
for this kind of convergence—which entail further restrictions on {4>k} than 
are imposed here—are given in Avrarn and Taqqu (1992), Theorem 2 and 
Tyran-Kaminska (2010), Theorem 1 and Corollary 1. However, when H < 
1/a and a £ (0,2), the sample paths of X are unbounded, and thus (2.7) 
cannot possibly hold [see Samorodnitsky and Taqqu (1994), Example 10.2.5]. 
In any case, (2.7) is not necessary for the main results of this paper; it merely 
permits Theorem 3.1 below to take a slightly strengthened form. 


Remark 2.5. In consequence of Theorem 3(i) in Jeganathan (2004), the 
convergence in (2.5) occurs jointly with 

1 n r 

a) ■= — y^J{x t - d n a ) £(a) / /, a <E R 

e n f-d.d. Jr 

for every / £ BI. Here, {£(a)} ag R denotes the occupation density (local time) 
of X, a process which, almost surely, has continuous paths and satisfies 

(2.8) [ f{x)C(x) dx= [ f(X(r))dr 

Jr Jo 

for all Borel measurable and locally integrable /. (For the existence of C, 
see Theorem 0 in Jeganathan (2004); the path continuity may be deduced 
from Theorem 3.1 below.) 



3. A uniform law for the convergence to local time. Our main result 
concerns the convergence 


(3.1) 


£ / n{ a i h"n) '■ — 


1 j. f x t - d n a 
Cnhn \ h n 


^C{a) / /, 


where £h(a, h n ) is regarded as a process indexed by (/, a)£^xl. (J^ - x R 
is endowed with the product topology, & having the L 1 topology, and R the 
usual Euclidean topology.) {h n } is a measurable bandwidth sequence that 
may be functionally dependent on {x t }, or indeed upon any other elements 
of the probability space; it is required only to satisfy: 


Assumption 2. h n £ := \h n ,h n ] with probability approaching 1 

(w.p.a.l), where h n = o{d n ) and h~ [ = o(e n log~ 2 n). 


Define 
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and let BILg denote the subset of Lipschitz continuous functions in Big. In 
order to state conditions on SF C BI that are sufficient for (3.1) to hold, we 
first recall some definitions familiar from the theory of empirical processes. 
A function F: M —R + is termed an envelope for IF, if sup^gj?- |/(x)| < F(x) 
for every x 6K, Given a pair of functions l,u G L 1 , define the bracket 

[l,u] := {/ €E L l \l(x ) < f(x) < u(x),\/x G M}; 

we say that [l,u] is an e-bracket if ||u — Z||i < e, and a continuous bracket 
if l and u are continuous. Let (e, ,F) denote the minimum number of 

continuous e-brackets required to cover F. 

Assumption 3. (i) F C BI has envelope F E BILg, for some j3 > 0; and 

(ii) for each e > 0, N( (e, F) < oo. 

We may now state our main result, the proof of which appears in Section 6. 


Theorem 3.1. Suppose Assumptions 1-3 hold. Then: 

(i) (3.1) holds in l ncc (F x R); 
and if additionally (2.7) holds, then 

(ii) (3.1) holds in £oo{F x M). 

Remark 3.1. The case where h n = 1, .'F = {/} and {xt} is a random 
walk—which here entails H = 1/a and (j>i = 0 for all i > 1—has been stud¬ 
ied extensively: see in particular Borodin (1981, 1982), Perkins (1982) and 
Borodin and Ibragimov (1995), Chapter V. In those works, it is proved (un¬ 
der these more restrictive assumptions on {xf}) that 



on 4o(R x [0,1]), where C(a;r) denotes the local time of X restricted to 
[0,r]. Theorem 3.1 could be very easily extended in this direction; we have 
refrained from doing so only to keep the paper to a reasonable length. The 
principal contribution of Theorem 3.1 is thus to extend this convergence in 
a direction more suitable for statistical applications, by allowing {uj} to be 
serially correlated and the bandwidth sequence {h n } to be data-dependent. 

Remark 3.2. After the manuscript of this paper had been completed, 
we obtained a copy of an unpublished manuscript by Liu, Chan and Wang 
(2014) in which, under rather different assumptions from those given here, 
a result similar to Theorem 3.1 is proved (for a fixed / and a deterministic 
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sequence {/i n }). Regarding the differences between our main result and their 
Theorem 2.1, we may note particularly their requirement that there exist a 
sequence of processes {X*} with X* =g X , and a 6 > 0 such that 

(3.3) sup | X n (r) - X*(r)\ = o a . s .(n~ s ), 

r€[0,l] 

a condition which excludes a large portion of the processes considered in 
this paper, in view of Remark 2.4 above. [The availability of (3.3) permits 
these authors to prove their result by an argument radically different from 
that developed here.] On the other hand, our results do not subsume theirs, 
since these authors do not require vt to be a linear process. 

Although Assumption 3 requires that & have a smooth envelope and 
smooth brackets, it is perfectly consistent with & containing discontinuous 
functions. Indeed, Assumption 3 is consistent with such cases as the follow¬ 
ing, as verified in Section A of the Supplement. [We expect that boundedness 
and / |/(x)||x dx < oo could also be relaxed through the use of a suitable 
truncation argument, such as is employed in the proof of Theorem V.4.1 in 
Borodin and Ibragimov (1995).] 

Example 3.1 (Single function). & = {/} where / £ Big, and is ma- 
jorised by another function F £ BIL^, in the sense that |/(x)| < F{x) for all 
x £ R. This obtains trivially if / is itself in BIL^ (simply take F(x) := |/(x)|), 
but is also consistent with / £ BI^ having finitely many discontinuities (at 
the points {afc}(L], where a*, < afc+i), and being Lipschitz continuous on 
(—oo,ai) U [or-, 0 o); all that is really necessary here is for / to have one¬ 
sided Lipschitz approximants. Importantly, this includes the case where 
f{x) = l{x £ 1} for any bounded interval I. 

Example 3.2 (Parametric family). & = {g(x,9)\9 £ 0} C BILg, where 
0 is compact, and there exists a r £ (0,1] and a g € BILg such that 

\g(x,9) - g(x,9')\ < g{x)\\9 - 9'f 

for all 9,9' £ 0. 

Example 3.3 (Smooth functions). & = {/ £ C T (M)||/| < F}, where 
F £ BIL^ and 

C£(R) := {/ £ BI| 3C f < L s.t. | f(x) - f{x')\ <C f \x - x'\ T Vx,x' £ M} 
for some r £ (0,1] and L < oo. 

4. Outline of proof and auxiliary results. 

4.1. Outline of proof. The principal relationships between the results in 
this paper are summarized in Figure 1. The proof of Theorem 3.1, depicted 
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K) £(a) f f on £°°(^ x R) 
(Theorem 3.1) 

t X. 

Sec. 6 Sec. 6 Sec. 6 


Decomposition £j£(a) £(a) Unif. negligibility of 

(4-1) in £°°(R) £f(a,h n )-£%(a)f f 

^ f f 

Sec. 6 Sec. 6 


Sec. 6 


Remark 2.5 
(Jeganathan, 2004) 


Proposition 4.1 


Proposition 4.2 


Sec. 7 Sec. 7 

II ^ 


Sec. 7 


\ ^ Sec ' 7 
Martingale Lemma 7.3 
decomp. (7.1), 


Lemma 7.5 


Sec. 10 


Lemma 7.1 
(Bercu & Touati, 2008) 


Lemma 7.4 


Sec. 10 


Sec. 10 Sec. 10 


Lemmas Lemma 9.3 
9.1, 9.2, 9.4 


Fig. 1. Outline of proofs. 


in the top half of the figure, proceeds as follows. To reduce the difficulties 
arising by the randomness of h = h n , we decompose 

Cf n {a,h)-m{a) f /l, 

JR . 

where 



(4.2) cp(x) := (1 — |x|)l{|x| < 1} 

denotes the triangular kernel function, and £j?(a) :=jCn(a, 1). (This choice 
of tp is made purely for convenience; any compactly supported Lipschitz 
function would serve our purposes equally well here.) It thus suffices to 
show that Cfi C in £°° (R), and that the bracketed term on the right-hand 
side of (4.1) is uniformly negligible over ( f,h ) £ & x 3%^. 

In view of Remark 2.5 above, the finite-dimensional distributions of Cn 
converge to those of C. The asymptotic tightness of Cn will follow from the 
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bound on the spatial increments 

j n i n 

£%(ai) ~ ££(a 2 ) = — ^2[ip(x t - d n ai) - ip(x t - d n a 2 )\ =: —^gi(x t ), 

6n t =1 6n t= l 

given in Proposition 4.1 below. The bracketed term on the right-hand side 
of (4.1) may be written as 


(4.3) 



x t - d n a\ 

h J 


p(x t - d n a ) / / 


P.. ZJ 


t= 1 


Control of (4.3) over progressively denser subsets of & x is provided by 
Proposition 4.2 below; the conjunction of a bracketing argument and the 
continuity of the brackets suffices to extend this to the entirety of & x 3^ n . 

By construction, both f g i = 0 and f g 2 = 0. The proofs of Propositions 
4.1 and 4.2 may therefore be approached in a unified way, through the 
analysis of sums of the form 

n 

(4.4) S n g:=^2g(x t ), 

t =i 

where g ranges over a class , all members of which have the property that 
f g = 0. Such functions are termed zero energy functions [Wang and Phillips 
(2011)]; we shall correspondingly term {S n g} ge & a zero energy process. Such 

processes are “centered” in the sense that e n ; S n g converges weakly to a 
mixed Gaussian variate [Jeganathan (2008), Theorem 5]; whereas e~ l S n g 

£(o )f 9 if f o. 

Equation (4.4) will be handled by decomposing S n g as 

n— 1 

Sn9 = 'y ^ J^dnkg T J\f n g, 

k =0 

where each M. n kg is a martingale; see (7.4) below. We provide order es¬ 
timates for the sums of squares and conditional variances of the M n kg's 
(Lemma 7.3); by an application of either Burkholder’s inequality, or a tail 
bound due to Bercu and Touati (2008), these translate into estimates for 
the Ai nk g’s themselves. Propositions 4.1 and 4.2 then follow by standard 
arguments. 


4.2. Key auxiliary results. To state these, we introduce the quantity 

(4.5) 11/110 := inf{c 6 R + \\f(X)\ < c|A|^, VA € M} 

for / E BI, f3 E (0,1], and /(A) := f e lXx f(x)dx. It is easily verified that 
H/II0 is indeed a norm on the space BI^j := {/ E BI |||/||0 < 00} (modulo 
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equality almost everywhere). Some useful properties of \\f\\[p] are collected 
in Lemma 9.1 below; in particular, it is shown that Bl^gj contains all / G Big 
for which f f = 0. Define 

(4-6) p H :=±zJL Ali 


noting that (5 H € (0,1] for all H G (0,1), and let || • || r2/3 denote the Orlicz 
norm associated to the convex and increasing function 


(4.7) 


72/3 (*) 


x(e — 1), if x G [0,1], 
g x 2 / 3 _ if x G (1, oo). 


[See van der Vaart and Wellner (1996), page 95 for the definition of an Orlicz 
norm.] A bound on the spatial increments of Cf is given by: 


Proposition 4.1. For every (5 G (0,/3 H ), there exists Cp <oo such that 
sup ||££(ai)-££(a 2 )|| T <Cp\a 1 -a 2 \ 13 . 

ai,a 2 GK ' 


The next result shall be applied to prove that the recentered sums (4.3) 
are uniformly negligible. Since the order estimate given below is of interest 
in its own right [see Duffy (2015), for an example of how it may be used 
to determine the uniform order of the first-order bias of a nonparametric 
regression estimator], we shall state it at a slightly higher level of generality 
than is needed for our purposes here. For & C BI^j, define 

(4.8) 5 n (j3,^) := Halloo + ey 2 (||^||i + H^lb) + e n d n ^ ||^||[/3], 
where \\&\\ := sup /eJr ||/||. 

Proposition 4.2. Suppose f3 G (0 ,/3 H ) and JF n C BI^j with < n c . 
Then 

(4.9) max \S n f\ < p 6 n (P,& n )\ogn. 

If also ||^n||i<l, \\^n\\[p}=o{df L ) and ||^n||oo = o(e n log -2 n), then 

max \S n f\ = o p (e n ). 

/SJL* 

Remark 4.1. As is clear from the proof, if /3 G [/3 H , 1] then (4.9) holds 
in a modified form, with e n dn^ ||^||[/ 3 ] in (4.8) being replaced by 

~n —1 n—1 

E4 (1+ ' s, +4 /2 E fc - 1/2 A (1+W2 

_k=l k =1 

The proofs of Propositions 4.1 and 4.2 are given in Section 7 below. 


\m\py 
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5. A preliminary application to nonparametric regression. Suppose that 
we observe generated according to the nonlinear cointegrating 

regression model 


Ut = m 0 (x t ) + u t , 


where {ut} is some weakly dependent disturbance process. As shown in 
Wang and Phillips (2009b), under suitable smoothness conditions the un¬ 
known function mo may be consistently estimated, at each fixed i 6K, by 
the Nadaraya-Watson estimator 


(5.1) 


rh(x) 


YJLi K h n {x t -x)y t 
ElLi K h n (x t - x ) 


i \ , T,t=i K h n (xt-x)[{m 0 {xt)-m 0 (x)) + u t } 
° [>+ YZ.iKh n (xt-x) 


where Kh(u ) := h l K(h 1 u), and K £ BI is a positive, mean-zero kernel 
with f R K = l. 

Now consider the problem of determining the rate at which rh converges 
uniformly to mo- As a first step, we would need to obtain the uniform rate 
of divergence of the denominator in (5.1); it is precisely this rate that the 
preceding results allow us to compute. By Theorem 3.1, 


(5.2) 


— y Z K hn(xt- d n a) 


t=1 


C(a) 


in ^oo(M), provided ¥{h n € J^n] —>• 1, and K satisfies the requirements of 
Example 3.1 (which seems broad enough to cover any reasonable choice of 
K). Since a i-A C(a) is random—being dependent on the trajectory of the 
limiting process X —we now face the problem of identifying a sequence of 
sets on which the left side of (5.2) can be uniformly bounded away from 
zero. A natural candidate is 


A £ n := {x € K| Cnid^x) > e}, 


where e > 0. C n is trivially bounded away from zero on this set, whence 

-l 


(5.3) 


sup 
xe As, 


J2 K h n ( Xt ~ X ) 


U=i 


More significantly, for any given 8 > 0, we may choose e > 0 such that 


(5.4) 


lim sup P 

n—> oo 




see Section 8 for the proof. That is, e > 0 may be chosen such that AF n 
contains as large a fraction of the observed trajectory {xt}” =1 as is desired, 






UNIFORM CONVERGENCE TO LOCAL TIMES 


13 


in the limit as n —> oo. [Were we to allow s = e n —> 0, we could permit 
5 = 5 n —> 0 here, but the order of (5.3) would necessarily be increased.] 

Note that the sample-dependence of A ® is necessary for it fulfill two roles 
here, by being both “small” enough for (5.3) to hold, but also “large” enough 
to be consistent with (5.4). If A e n were replaced by a sequence of determin¬ 
istic intervals (or sets, more generally), then the maintenance of (5.3) would 
necessarily come at the cost of violating (5.4). For example, when xt is a 
random walk with finite variance (a = 2), the “widest” sequence of intervals 
[— a n ,a n \ for which (5.3) holds is one for which a n = o(n 1//2 ): but in conse¬ 
quence, the fraction of any trajectory {xt}™ = i falling within such an interval 
will converge to 0, as n —> oo [see Remark 2.8 in Duffy (2015)]. 

In this respect, the availability of Theorem 3.1 allows us to improve upon 
the analysis provided in an earlier paper by Chan and Wang (2014)— and, 
in the random walk case, that of Gao et al. (2015)— who obtain uniform 
convergence rates for m n on precisely such intervals [see Duffy (2015) for 
further details]. We expect that it would also play a similarly important role 
in the derivation of uniform convergence rates for series regression estimators 
in this setting, by ensuring the eigenvalues of the design matrix diverge at an 
appropriate rate, when attempting to estimate mo on a sequence of domains 
that contains most of the observed {xt}” =1 . 

6. Proof of Theorem 3.1. We shall prove only part (i) of Theorem 3.1 
here; the relatively minor modifications required for the proof of part (ii) 
are detailed in Section B of the Supplement. Let M < oo be given; it suffices 
to prove that (3.1) holds in £ 00 [—M,M], To simplify the exposition, we shall 
require that h n E always; the proof in the general case (where this occurs 
w.p.a.l) requires no new ideas. The proof involves three steps: 

(i) show that Cf(a) C(a), using Proposition 4.1; 

(ii) deduce C'ffa, h n ) C{a) f M f for / E BIL^, using a recentering, Propo¬ 
sition 4.2 and the Lipschitz continuity of /; 

(iii) extend this to all / E & C BI, where & satisfies Assumption 3, via 
a bracketing argument. 

(i) Let (p be the triangular kernel function, as defined in (4.2) above, 
and set /3q := /3#/2. Recall that Ch{a) := £h(a, 1). By Proposition 4.1 and 
Theorem 2.2.4 in van der Vaart and Wellner (1996), 

sup \Cf(a') - Cf(a)\ 

{a,a'GM||a—a/|<<5} 

< rlog 3/2 (Me-^ 0 ) de + (Dog 3 / 2 (Mr 2 /*) < C M S 1/2 , 

Jo 
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whence Ct, is tight in Thus, in view of Remark 2.5, 

(6.1) C*{a)~*C{a) 

in loo[—M, M] [see van der Vaart and Wellner (1996), Example 2.2.12], 

(ii) Now let / £ BIL^; we may without loss of generality take / to be 
bounded by unity, with a Lipschitz constant of unity. For the subsequent 
argument, it will be more convenient to work with the inverse bandwidth 
b\=h ~ l . Define 

J> n :={r 1 |/jGJf n }= \b n ,b n \ ■■= 

and let f( a> fy(x) := bf[b(x — d n a)], for (a, b) GlxR + . Take C n := [—n 7 ,n 7 ] x 
let c 6.’ n C C n be a lattice of mesh n~ s , and let p n (a,b) denote the pro¬ 
jection of (a, b) £ C n onto a nearest neighbor in (with some tie-breaking 
rule). The following is a straightforward consequence of the Lipschitz conti¬ 
nuity of / (see Section C of the Supplement for the proof). 


Lemma 6.1. For every 7 > 1, there exists 5 > 0 such that 
1 n 

SUP T" f{a,b){ x t) - fp n (a,b) ( x t) | =o p { 1). 
((l,b)^Cn n l = \ 


By taking 7 > 1, we may ensure that C n D [— M, M) x for all n suffi- 
ciently large. Thus, for ^ (a) := and p f := f R f, 

( 6 . 2 ) sup |-^n( a , fr _1 )~ hf^n( a )i 

(a,b)e[-M,M]x3g„ 

1 U 

< SUp — V[/(a,6)(^t) -FfF{a)(x t )] 

(a, 6 )gC„ t=1 


(6.3) 


1 

< sup — 

(a,b)G%. e n 


n 

£[/<«. b)(xt) - fJ>f<P(a)(x t )] 
t =1 


+ Op(l) 


1 

= sup — 

36%. e n 


t =1 


+ °p(i)) 


by Lemma 6 . 1 , and we have defined := {f( a ,b) ~ t l fP(a)\( a ,b) £ & ’ n }• It is 
readily verified that ||g||i = 1, < n 1+7+2<5 , and using Lemma 9.1(h), 

sup || 5 ||[/ 3 ] <kn 3 = o{d^), sup H^Hoo < b n < e n log ' 2 n. 

36%. 36%. 


Thus, y n satisfies the requirements of Proposition 4.2, whence (6.3) is o p (l). 
Hence, in view of (6.1), 


(6.4) 


£n{a,h n ) C(a) [ f 

J R 
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in 4o[—M,M], for every / € BILg. 

(iii) Finally, for / G BI define the centered process 

:=Cl(a,h n ) - C*{a) [ /. 

JR 

For a given e > 0, let {£fc, u fc}fLi denote a collection of continuous L 1 brack¬ 
ets that cover 3F, with ||Uk — 4111 < e; the existence of these is guaranteed 
by Assumption 3. We first note (see Section C of the Supplement for the 
proof) 

Lemma 6.2. Under Assumption 3, the brackets {h,Uk}^ = i can be cho¬ 
sen so as to lie in BIL^. 


For each / G & , there exists a k £ {1,... ,K} such that Ik < / < Uk, 
f R (u k -/)<£, and 


1 

Vn(f,a) 

e n 


1 fx t - d n x 


h. 


-Uk 


h ri 



f 


<is n (uk,a) + £%(a) f ( u k -f )• 

JR 

Taking suprema, 

sup v n (f, a) < max sup v n (uk,a) + £ sup £„(a) 
l<k<K ae[-M,M] ae[-M,M] 

= e sup C%(a) + o p (l) 
ae[-M,M] 

with the second equality following by (6.4), since we may take Uk G BIL^ by 
Lemma 6.2. Applying a strictly analogous argument to the lower bracketing 
functions, Ik, we deduce that 

(6.5) sup \u n (f,a) \ <£ sup |££(a)| + o p (l) = o p (l), 

if,a)e^x[-M,M] ae[-M,M] 

whence (3.1) holds in £oo[—M,M], in view of (6.1). 


7. Controlling the zero energy process. The proofs of Propositions 4.1 
and 4.2 rely on a telescoping martingale decomposition similar to that used 
to prove maximal inequalities for mixingales [for a textbook exposition, see, 
e.g., Davidson (1994), Sections 16.2-16.3], which reduces S n f to a sum of 
martingale components. In order to pass from control over each of these 
components to an order estimate for S n f itself, we shall need the following 
results, the first of which is a straightforward consequence of Theorem 2.1 
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in Bercu and Touati (2008), and the second of which is well known. For a 
martingale M := {Mt }” =0 with associated filtration Q := {Qt}t=oi define 

n n 

(7.1) [M] := £(M t - M f _i) 2 , (M) := ^E[(M f - M t -i) 2 \Gt-i\- 

t= l t=i 

We say that M is initialised at zero if Mo = 0. Let || • || T1 denote the Orlicz 
norm associated to t\(x) := e x — 1. 


Lemma 7.1. Let {0 n } denote a sequence of index sets, and {K n } a 
real sequence such that #0 n + K n < n c . Suppose that for each n G N, k G 
{1,..., K n } and 6 G @ n , M nk {6) is a martingale, initialised at zero, for which 

( 7 - 2 ) ^nk '■= max{||[M nfc (0)]|| V ||(M nfc (0))|L } < oo. 


Then 


max 

6>G0n 


K n 

E 

k =1 


M nfc (0) 



logn. 


Lemma 7.2. Let Z be a random variable. Then: 

(i) \\Z\\ p <p\ l ^a for allp&n, if and only if \\Z\\ Tl < a; 

(ii) || Z\\ 2p < (?>p)\ 1/2p o for all p€ N, if and only if ||Z|| r2/3 < a. 

The proofs of Lemmas 7.1 and 7.2 appear in Section D of the Supplement. 


7.1. The martingale decomposition. For a fixed / G BI^j, it follows from 
Lemma 9.3(ii) below and the reverse martingale convergence theorem [Hall 
and Heyde (1980), Theorem 2.6], that 

l|Et/(*t+fc)ll 0 o £ 4 (1+/3) 0, K t _ k f(x t ) 4E/(x t ) + 0 

for each t > 0 as k -A oo; here E t f(xt+k) '■= ^[f(%t+k) [-F^], f° r J~s := 
a ({e r }* =s ). Because {f(xt)} is asymptotically unpredictable only in the “for¬ 
wards” direction, we truncate the “usual” decomposition at t = 0 , writing 

t 

f(xt ) = ^[E t _ fc+ i/(xt) - E t _ k f{x t )] + E of(xt)- 

k =1 

Performing this for each 1 < t < n gives 

n 

^2 = E of(xi) + [f(x i) - E 0 /(xi)] 

t= l 


+ Eo/(a? 2 ) + [/(x 2 ) - Ei/^2)] + pEi/(x 2 ) - E 0 /(x 2 )] H- 

+ E 0 f(x n ) + [f(x n ) - E n __i/(x n )] + [E n _i f(x n ) - E n - 2 f(x n )\ 
H-L [El f(x n ) -E of(x n )}- 
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Defining 

(7.3) i k tf := E t f(x t+k ) - E t _i f(x t+k ) 

and collecting terms appearing in the same “column” of the preceding dis¬ 
play, we thus obtain 

n n n —1 n 

s n f=j2f( x t)=J2 E o f(x t ) + EE [E t-kf(x t ) -Et_ fe _i f(x t )] 

t =1 t= 1 fc=0t=fc+l 


(7.4) 


n n—ln—k 

= + Y ^[Etfixt+k) - ^t-if(x t+ k)} 

t= 1 fc=0 t=l 

n n— In—k 

= y fi o f(x t )+ Y h f 

t =1 k =o t =1 


n— 1 

= A7„/ + ^M nfc /, 

/c=0 

where 

n n—/c 

Nnf -=Y Eo/( X t)> Mnkf -=Y^ kt f- 

t= 1 t=l 


A bound for ||A/’ n /|| 00 is provided by Lemma 9.3(h) below. {£&(/, 
by construction, forms a martingale difference sequence for each k, and so 
control over each of the martingale “pieces” M n kf will follow from control 
over 


U nk f ■= 


n—k 


[Mnkf\ = Y&f> 

t =1 


V nk f--= (M nk f) = Y E t-l£tf, 

t =1 


in combination with either Burkholder’s inequality [Hall and Heyde (1980), 
Theorem 2.10] or Lemma 7.1 above, as appropriate. 


7.2. Proofs of Propositions 4-1 cmd, 4-%- Define 

n 

c„(A/):=ll/llo„ + ll/lli + ll/ll|„)EV (1+ '’ ) 

and 


t= l 


II r ||2 


vlk(PJ) ■= < 


+ \\f\\le n , if fc€{0 ,...,/c 0 }, 


,-(1+2/3),, , l|2 


+ e _7lfc ||/||f], 


if k G {ko + 1 ,.. .,n — 1}. 

The following provides the requisite control over the components of (7.4). 
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Lemma 7.3. For any f3 E [0,1], 

(7-5) \\Mnf\\oo<^n(P,f), 

and for all 0 < k < n — 1, 

(7-6) H^nfc/llri V ||V n fc/|| ri < &nkiPi /)• 

The proof of (7.6), in turn, relies upon the following. 


Lemma 7.4. For every k E {0, ... ,n — 1}, t E {1, ... ,n — k} and (3 E (0,1] 

n—k—t 

Ultf Woo + E ll E 4t +S /lloo^^(A/). 


The proofs of these results are deferred to Sections 9 and 10. We shall 
also need the following, for which we recall the definition of 6 n (/3,&) given 
in (4.8) above. 


Lemma 7.5. If f3 € (0 ,/3 H ) and C BI^] ; then there exists a Cp < oo 
such that 


n —1 

sup (/?,/) + V'supo’nfe^,/) < Cp8 n {P,^). 


The proof appears in Section D of the Supplement. We now turn to: 

Proof of Proposition 4.1. Let g € BI^j. Burkholder’s inequality, 
and Lemmas 7.2(i) and 7.3 and give 

\\M nk g\\ 2 p < h]l^ p \\U nk g\\ l p 12 < (b 2p ■ p\) 1/2p a nk (/3,g) < {3p)\ 1/2p a nk {/3,g) 

for every p E N, where b 2p depends on p in the manner prescribed by Burkholder’s 
inequality. Hence, \\M nk g\\T 2/3 <v nk ((3,g) by Lemma 7.2(h). Then by (7.4), 
Lemma 7.3 and Lemma 7.5 (taking = { g }) 

n—l 

(7.7) ||«5 n 3|| r2/3 < II^IU+ Ell^lk /3 < C5 n {/3,g) 

k =o 

for some C < oo depending on (3. 

For ai,a 2 E M, set A := \a\ — a 2 \ and define 

^[ai.aa]^) : = ¥>(® ~ ^ai) - <p{x - d n a 2 ). 

Let (3 G (0,/3 h ). Since ip is bounded and Lipschitz, 

ll¥’[a 1) o 2 ]l|oo < (d n A) A 1 < A /j ; 
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and further, since ip is bounded and compactly supported, 

\\V[ai,a 2 ]\\p < 2 ll^[ai,a 2 ]ll(y^ 1_ lp £ d&A?, 

for p G {1,2}. Finally, by Lemma 9.1 (iii), 

11 </?[«!,a 2 ] II [/3] ^d^A 13 . 

Then by (7.7) and the definition of 5 n (/3,<^), 

ll^n( a l) — ^n( a 2)llr 2 /3 = e n ||'S'n¥ , [oi,a 2 ] l|r 2 / 3 

KCie-^-d^ + d^-d^) 

<CA?, 

for some C depending on /3; here we have used the fact that since (3 < f3 H < 
{en l ^ 2 dn} is regularly varying with index H(f3 — ^jf-) <0. □ 


Proof of Proposition 4.2. In view of Lemmas 7.3 and 7.5, we have 


max \M n f\ < max q n (/3, f) < p 5 n (/3, & n ), 


and by an application of Lemma 7.1, 

71—1 


max 

/GJ^n 


Y, M nkf 


k=0 


<p<5 n (/3, Jvjlogn. 


Thus, (4.9) follows from (7.4). 

For the second part of the result, note that under the stated conditions 
on ,^Vj, 

Wnh < ll^n||^ 2 ||^n|li /2 = o[ey 2 log _1 n], 

whence 

e~ 1 d n (l3,^ r n) = o p (}og~ 1 n) + d~ l3 o p (dd) = o p (l) 
whereupon the result follows by (4.9). □ 


8. Proof of (5.4). Let /i n (a) := ^ Y^t=\ ^-{d n lx t — °} and A*( a ) : = 
f® £(x)dx. It is shown in Section E of the Supplement that 

( 8 . 1 ) 

in foo(®1), jointly with the convergence in Theorem 3.1. 

Let T(x) := l{.x < e} and C n (a) := C!^(a,h n ). We first note that 


1 

n 


]T 1 {x t i A £ J 

t= 1 


1 -y ^ f 

- s }Tl{C n {d~ 1 xt)<e}= / T(C n (a))dn n (a) 

n-ti J r 
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by definition of A £ n and /d n . We shall now suppose that C n ^4 C n in £ UCC (R), 
and \± n '"—t /x in 4>o(R)> as may be justified [in view of Theorem 3.1 and ( 8 . 1 )] 
by Theorem 1.10.3 in van der Vaart and Wellner (1996); let flo C D denote 
a set, having Pflo = 1, on which this convergence occurs. Define 

{ 1, if x < e, 

e _1 (2e — x), if x G (e, 2e), 

0, if x > 2e. 

Then, fixing an q, 

[ T(£"(a))d<(a)< [ T(£"(a))d<(a) = [ F n (a)d^(a), 

J ]R JR. J R. 

where F n (a) := (T o £^)(a). Now let [c, d\ be chosen such that n w (d) — 
H u {c) < e. Since T is uniformly continuous, F n (a) —lF(a) := (T o C ul ){a) 
uniformly over a G [c, d] , whence 

[ F n (a)dHn(a) < [ d// n (a) + [ F(a) d/x£(a) + sup |F n (a) - F(a)| 

Jr J[c,d] c [c,cZ] aG[c,d] 


—>■£+/' F(a)d/r^(a) 

•7[ c ,d] 

< £+ [ F(a) d/jF(a), 

Jr 

where the convergence follows by the Portmanteau theorem [van der Vaart 
and Wellner (1996), Theorem 1.3.4], since F is continuous. 

Thus, 

limsupp| — 1 {xt A £ } > 5 )■ < limsupP^ f T(C n (a)) d/d n (a) > d\ 

n—>oc I Tl |Y| I n->oo (,/R J 


( 8 . 2 ) <P7e+ / T(C(a)) d//(a) > 5 

{ Jr 

<p|e + j l{£(a) < 2 e}£(a) da > <5j>, 

where noting that £ is the density of //, the final inequality follows from 
/ T(£(a)) d/x(a) < / l{£(a) < 2 e} d/x(a) = / l{£(a) < 2e}£(a) da. 

J ]R «/K. «/]R 

Finally, 


£ + f l{£(a) < 2 £}£(a) da ^4 f l{£(a) = 0 }£(a) da = 0 

Jr J ]R 

as £ —>• 0 , by dominated convergence, and so £ > 0 may be chosen such that 
the right side of ( 8 . 2 ) is less than 5. 
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9. Results preliminary to the proofs of Lemmas 7.3 and 7.4. Our ar¬ 
guments shall rely heavily on the use of the inverse Fourier transform to 
analyse objects of the form Ktf(xt+k), similarly to Borodin and Ibragimov 
(1995), Jeganathan (2004, 2008) and Wang and Phillips (2009b, 2011). Pro¬ 
vided that / G BI and Y has an integrable characteristic function i\)y i the 
“usual” inversion formula 

(9.1) E/(y 0 + Y) = ^~ [ /(A)e- i ^°Ee- iAY dA 

for yQ G M, is still valid, even when /(A) = j f(x)e lXx dx is not integrable; 
these conditions will always be met whenever the inversion formula is re¬ 
quired below. The following provides some useful bounds for /. 


Lemma 9.1. For every / G BI and (J G (0,1], 

(i) |/(A)| < (|A|^||/|| [/S] ) A H/llr; 

(h) if J f = 0 , then 

||/||[/j] <2^ inf f \f(x-y)\\xfdx 

Jr 

and so BI [/3] D {/ G BIg| / f = 0}; 

(iii) if f(x) := g(x — ai) — g(x — 0,2) for some ai,a,2 €.M., then 

ll/ll[fl<2 1 -' S |ai-<i2l' S |l»lli. 


Let if := a({e r }(, =s ), noting that Ffl _IL J 7 !!* for si < S 2 < S3 < S 4 . For 0 < 
s < t, we shall have frequent recourse to the following decomposition: 

t t 00 t—s i 

(9.2) x t = ^ 2 v t = j 2 Yl0 i£k ~ i =: x °-ht + Y Y fa =:x *-i ,t + x 's,t,t > 

k=l k=l 1=0 i =0 j =0 


where t _LL x' stt and x*_ 11 is JFfJ, -measurable. 3 Defining a* := 0 r Ab 

we may further decompose x' s t t as 

t r t 

(9.3) x s t t = 'y ^ at-i£i = y ^ at-i£i + y ^ at-i£i =: x s r i + av+i,t,ti 

i=s i=s i=r +1 


where x’ s r t is TJ-measurable, and x' r+111 is J^ +1 -measurable. The following 
property of the coefficients {a, } is particularly important: there exist 0 < a < 
a < oo, and a ko G N such that 


(9.4) 


a < 


inf inf c. 1 
k 0 +l<k\k/2\<l<k 


I a l\ < 


sup sup c k 1 \ai\<a. 
k 0 +l<k \k/2\<l<k 


is weighted sum of since these weights are not important for our 

purposes, we have refrained from giving an explicit formula for these here. 
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This is an easy consequence of Karamata’s theorem. Throughout the re¬ 
mainder of the paper, k o refers to the object of (9.4); it is also implicitly 
maintained k$ > 8po for po as in Assumption l(i). 

Having decomposed xt into a sum of independent components, we shall 
proceed to control such objects as the right-hand side of (9.1) with the aid of 
Lemma 9.1 and the following, which provides bounds on integrals involving 
the characteristic functions of some of those components of xt- Recall that 
Assumption l(i) is equivalent to the statement that 


(9.5) 


log^(A) = — |A|“G(A) 


-i/3sgn(A)tan^^ 


for all A in a neighborhood of the origin, where G is even and slowly varying 
at zero [see Ibragimov and Linnik (1971), Theorem 2.6.5]. Here, as through¬ 
out the remainder of this paper, a slowly varying (or regularly varying) 
function is understood to take only strictly positive values, and have the 
property that G( A) = G(|A|) for every A G M. 


Lemma 9.2. Let p G [0,5], q G (0,2] and z\ , z 2 € M + . Then: 

(i) there exists a 71 > 0 such that, for every t> 0 and k > k$ + 1 , 

(zi|A| p A * 2 )|Ee“ iA *i+ 1 .*+M+‘| dA < ^i4 (1+p) + z 2 e~^ k 

1 

and if F(u) x G p ^ a {u) as u —>• 0, 


h 

J R 


/ 

JR 


(zi|afc| p |A| p+l? F(afcA) A ^ 2 )|lEe lXx t+i,t+k,t+k | dA 


< zi k- p / a d- {1+q) + z 2 e~^ k - 
(ii) for every t> 1 , k>ko + l and s G {ho + 1 ,... ,t}, 

f | Ee - i M- i +i,t-i,t+*|dA< —dj 1 . 

Jr c k-\-s 

The preceding summarizes and refines some of the calculations presented 
on pages 15-21 of Jeganathan (2008). It further implies: 


Lemma 9.3. Let f G BI. Then: 

(i) for every t> 0 and k>ko + l 

E t |/(x m )|<4 1 ||/|| 1 ; 

(ii) if in addition f G Bl^j, then for every t> 0 and k>ko + l, 

|E./fe + i)l<e-’‘‘||/|| 1 +<iA 1 + ' ,> ll/llM- 
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For the next result, define 

i?(zi,z 2 ) :=E[e _izieo -Ee~ i21£0 ][e _iZ2£0 -Ee" i22£0 ]. 

Lemma 9.4. Uniformly over z\,Z 2 € K, 

mzi,Z 2 )\ < [\zi\ a G( Zl ) A l] 1/2 [\z 2 \ a G(z 2 ) A 1] 1/2 , 
where G(u ) x G(u) as u —> 0. 

Proofs of (9.1), (9.4) and the preceding lemmas are given in Section F of 
the Supplement. 

10. Proofs of Lemmas 7.3 and 7.4. 

Proof of Lemma 7.3. By Lemma 9.3(ii), 

ko n 

K/|< J>o/(*t)|+ E l E o/(^)l 

t= 1 t=ko~\-l 

n 

< 11/11=0+ £ [^“II/IIi + A'^II/IIm]. 

t=/c0 + l 

whence (7.5). Regarding (7.6), it follows from repeated application of the 
law of iterated expectations that 

n—k n—k 

nv nk f\ p <p!-e- E e [Ea-i (fktj )• 

^ 1 = 1 tp —i — tp —2 

( 10 . 1 ) 

( n—k—tp— i 

Ult p .J\\oo + E ll E tp-i-l^,tp- 1 +-/lloc 

S =1 

more details of the calculations leading to (10.1) are given in Section G of 
the Supplement. By Lemma 7.4, the final term on the right is bounded by 
Ca^kiPif)- Proceeding inductively, we thus obtain 

E| V nk f\ p <pl-Cr<T%(P,f), 

whence the required bound follows by Lemma 7.2 (i). An analogous argument 
yields the same bound for U. n \.f. □ 

Proof of Lemma 7.4. We shall obtain the required bound for E 4 £| t+s f 
by providing a bound for E t - s £,l t f (for s £ {1 ,... ,f}) that depends only on 
k and s (and not t ), separately considering the cases where: 
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(i) k £ {/co + 1 ,..., n — i}; and 

(ii) ke{0,...,k o }. 

(i) Recall the decomposition given in (9.2) and (9.3) above, applied here 
to reduce xt+k to a sum of independent pieces, 

_ * I / | / | / 

x t+k ~ x O,t+k + x l,t-l,t+k + x t,t,t+k T x t+l,t+k,t+k 

= X 0,t+k + + x f+l,i+M+k 

with the convention that x) t—1 = 0 if t = 1, so that by Fourier inversion, 

Zktf = E tf{xt+k ) - E t -if(x t+k ) 

(10.2) = — f f(\)e~ iXx h+k e - iXx 'i,t-i,t+k 

2?r Jr 


Then 


e kt f= 


( 2 vr) s 


(10.3) 


x [ e -iA a k 6t _ Ee ~i\a k et ]E e -i^ + 1 ,t+fc,t+fc dA. 

f(\ 1 )f(X2)e~ i ^ Xl+X ' 2 ' >x °’ t + k e~ i( ' ) ' 1+> ' 2 ' >x ' 1 ’ t - 1 ’ t + k 

x )e —i '^ iafc£t _ Ee~ iAiafc£t ] [e —iA2afc£t — Ee —iA2afe£t ] 
x Ee~ lXlx ' t + 1 ’ t + k ’ t + k Ke~ lX2X ' t + 1 ’ t + k ’ t + k dAi dA 2 . 


Now suppose s G {A: + 1,... ,t}. Taking conditional expectations on both 
sides of (10.3) gives 




1 


( 2 vr) 


f(Xi)f(X2)e~^ Xl+X2 ^ x ^ t + k e~ l( ' Xl+X2 ^ x ' 1 ’ t - s < t + k 

x E e - i( ^ 1 +^ 2) ®t-.+ i ,t- i ,*+fc .^(Aia fc ,A 2 o fc ) 
x Ee -lAlx *+ 1 ’ t + fe > t+fe Ee -lA23: *+ 1 ' t + fe ’ t + fe dAi dA 2 , 


where we have defined 


0(zi,z 2 ) :=E[e- i2l£0 -Ee- i2l£0 ][e- izi£0 -Ee- i22£0 ] 
for zi,z 2 eK, and made the further decomposition 

x l,t—l,t~\~k X 1 ,t-s,t-\-k "I” x t—s+l,t—l,t+k 

with the convention that x\ t _ s t+k = 0 if s = t. Then, using (9.4), Lemma 9.4 
and \ab\ < \a\ 2 + |6| 2 , we obtain 

Kt-sZltfZ [[ |/(Ai)/(A 2 )| 

JJ R 2 

x [|«fcAi|"( 5 (afcAi) A l] 1 // 2 [|afcA 2 |"G(afcA 2 ) A l ] 1 ^ 2 
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(10.4) 

x |Ee -1 ( Al+A2 ' ):r *- s + 1 ’ t - 1 ’‘+fc | 
x |Ee _iAlx *+ 1 ' t + fc '*+ fc | |Ee _iA2X *+M+M+fc | dAi dA 2 

< /|/(A 1 )| 2 (|a fc r|A 1 rG(a fc A 1 )Al)|Ee- iAl <+M+M +fc | 

JR 

(10.5) 

x f |Ee _i(Al+A2)x <-«+ 1 . t - 1 > t + fc | dA 2 dAi, 

Jr 

where we have appealed to symmetry (in Ai and A 2 ) to reduce the hnal 
bound to a single term. By a change of variables and Lemma 9.2(h), 


| Ee -i( A i+ A 2 )<_ s+M - M+fc | d A 2 


( 10 . 6 ) 


f |Ee" iA < 

Jr 






-d s , 


while Lemma 9.1 (i) and then Lemma 9.2(i) give 


/ |/(A)| 2 (|a fc | Q |ArG(a fc A)Al)|Ee- iA 3 : ‘+M+M + /o|dA 

Jr 

(10.7) < f [(|a t r|A|"+ 2 ' , G(a i A)ll/llf sl ) A H/BlEe-'^+M+M-H.I dA 

J M 

<A.- 1 d- (1 + 2 ®||/|| 2 J| +e-A‘||/||2, 

Together, (10.5)-(10.7) yield 

(10.8) >i7 1 (V 1 <ij <1+2 ' ,) ||/|| 2 M + e-7> l ||/|| 2 ). 

Ck+s 1 J 

When s € {1,..., k}, (10.4) continues to hold, whence 

Ei -.(If < i^|/(A)|(|A(“/ 2 G 1 / 2 ( at A) A l)|Ee- lir '<«.<+*..M|dA) 

<( [ [(l«r /2 |A| W2+ ' ,) G 1/2 (o i A)ll/ll M )A||/|| 1 ) 

\Jr 

\ 2 

(10.9) x |Ee _lAXt + 1 ’ t + fc ’ t + fc | dA j 

<(fc- 1 / 2 4 (1+ ' S >||/|| [ffl +e-A*||/|| 1 ) 2 

<<i- 1 (V 1 4 ,1+2 »||/|| 2 J] +e-i«‘||/|| 2 ) 


by Lemmas 9.1 (i) and 9.2(i); in obtaining the hnal result, we have used the 
fact that s < k to replace a d^ 1 by dj 1 . Since {c*,} is regularly varying and 
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k > ko + 1, it follows from Potter’s inequality [Bingham, Goldie and Teugels 
(1987), Theorem 1.5.6(iii)], that 

k n n 

'52 d s 1 + Y = e n, 

s=l s=k +1 ^k+s s=l 

with the final bound following by Karamata’s theorem. As noted above, 
since the bounds (10.8) and (10.9) do not depend on t, they apply also to 
E t+s f ■ Hence, in view of the preceding, 

n—k—t 

E (V 1 4 (1+2S) ||/||f fl +6-^11/111) 

S=1 

<c„(V 1 4 (1+2 ' J) ||/||f a +e-T‘||/t|?). 

Turning now to ||^f t /||oo) n °t e that (10.2) still holds, with the convention 
that x\ t t-i,t+k = 0 if t = 1. Thus, again by Lemmas 9.1 (i) and 9.2(i), 

II&/IL < (jl|/(A)||Ee- |Al i + ...«..«|dA) 

< (j([|A|' 5 «/» M A||/l|l]|Ee- 1 ^ + 1 ..M..M|dA) 

< (ll/ll|U]4 (1+,J) + ll/llie" 11 *) 2 

<d- 2(1+ ®||/|| 2 M +e-T‘||/||? 

<e„(V<i- <1 + 2 »||/||^ ]+e -T.‘||/||5); 

where the hnal bound follows because k <n, and so d~ k 1 < k^nd^ 1 = k~ 1 e n . 

(ii) When sG{l,...,fco}, the crude bound E 4 _ s Zltf~ II/IIL suffices, since 
ko is fixed and finite. On the other hand, if s € {ko + 1,... ,t}, we have by 
Jensen’s inequality and Lemma 9.3(i) that 

E t-sCltf < Et- a (Etf(x t+k ) - E t _i/(x m )) 2 < E t - s f 2 (x t+k ) < d~ l \\f\\l 
Then, by Karamata’s theorem, 

n—k—t ko n—k—t 

Y E tilt+sf<Y^lt+sf + Y ^h+sf 

5—1 5 — 1 S=kQ + l 

n—k—t 

<11/11^ + 11/112 E d > 1 

S=fco + l 

<\\f\\ 2 oo + \\f\\W 

Regarding ||^ t /|[oo, the bound \\Cl t f\\oo < II/IIL obtains trivially. □ 


n—k—t 


Y d ° 1 + Y 


7-1 


5=1 


s=k -hi 


Ck-\-s 
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